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Abstract 

We present a variety of new results on finite sets A of integers for 
which the suniset A + A is larger than the difference set A — A, so- 
called MSTD (more sums than differences) sets. First we show that 
there is, up to afhne transformation, a unique MSTD subset of Z of 
size 8. Secondly, starting from some examples of size 9, we present 
several new constructions of infinite families of MSTD sets. Thirdly 
we show that for every fixed ordered pair of non- negative integers (j, k), 
as n —>■ oo a positive proportion of the subsets of {0, 1, 2, n} satisfy 
\A + A\^ {2n + 1) - J, \A- A\ = {2n + 1) - 2k. 

1. Introduction 

If A C Z the sumset and difference set of A are defined, respectively, as 

j4 + j4 := {x G Z : x = oi + 02 for some ai, 02 G A}, 
A — A := {x ^ 7i : X = ai — a2 for some ai, 02 G A}. 

If ^ is a finite set with n elements then it is easy to see that, a priori, 

n(n + 1) 

2n-l< A + A < ^ \ 
- I I - 2 ' 

2n-l <\A- A\<n{n-l) + 1. 

The upper bounds follow simply from the fact that addition is commutative 
whereas subtraction is not, and this certainly suggests that 'most' (speaking 
informally) finite sets A should have the property that l^ + ^l < |^ — Aj. 
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A precise result in support of this intuition was proven by Roesler [6] : he 
showed that for any n > and 1 < k < n, the average value of the quotient 
1^ — + A\, as A runs over the fc-clement subsets of {0, 1, lies 

in the interval [1,2). A question of murky origins, but dating back to the 
1960s, is whether there exist any finite sets A of integers at all such that 
l^ + ^l > 1^ — 74|. The question is interesting because of the apparent dif- 
ficulty in finding examples of such sets, whereas it is very easy to construct 
sets A with |^-|-^| = |^ — ^|. Any arithmetic progression has this property 
or, more generally, any set A with the property that A = {x} — A for some 
X G Z. Such a set is said to be symmetric (about x/2). 

Following the practice introduced by Nathanson [3], we refer to sets hav- 
ing more sums than differences as MSTD sets. Some examples of such 
sets appear in the literature from the late 1960s and early 1970s. Con- 
way is said to have found the example Ai = {0,2,3,4,7,11,12,14}. The 
example A2 = {0,1,2,4,7,8,12,14,15} appears in [2]. Another interest- 
ing example, A3 = {0, 1, 2, 4, 5, 9, 12, 13, 14, 16, 17, 21, 24, 25, 26, 28, 29}, ap- 
pears in [1]. Note that for i = 1,2, \Ai + Ai\ — \Ai — Ai\ = 1, whereas 

1^3 + ^31 - |^3--43| =4. 

The following two observations are pertinent : 

1. The property of being an MSTD set is invariant under linear mappings 
X i-^ ux + V, u,v € Z, u ^ 0. The sizes of sum- and difference sets are un- 
changed by such mappings. In particular, it suffices to consider finite subsets 
of Z which have smallest element 0, and such that the greatest common di- 
visor of the elements in the set is 1. Such sets will be called normalised. 

2. Starting from any MSTD set A we can construct a sequence A = 
Ai,A2,... oi MSTD sets such that the quotients |At -|- - At| become 
arbitrarily large. Choose an integer m and set 



If m is sufficiently large, then |ylj ± Aj| = |yl ± This method of con- 
structing an infinite family of MSTD sets from a single one will be called 
the base expansion method. 

In particular, these observations imply that any MSTD set gives rise to 
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an infinite family of such sets. It appears to have been an open problem 

for some time to find some other way of constructing an infinite family of 
MSTD sets. In [10], Rusza uses probabilistic arguments to prove the ex- 
istence of a multitude of MSTD sets. However, this still does not provide 
explicit constructions. Extending an observation of the author and Roesler, 
such constructions were eventually provided by Nathanson [3]. The idea 
for his type of construction comes from examples like Ai above. Note that 
that set is the union of a symmetric set {0, 2, 3, 7, 11, 12, 14} and the single 
number 4. The symmetric part contains an arithmetic progression {3, 7, 11}, 
with some extra numbers tagged on at both ends. Nathanson's sets have 
this type of structure. One starts with a (proper generalised) arithmetic 
progression, adds on some structure at both ends while retaining symmetry, 
then adds one further element which results in the sumset being enlarged 
by one element while the difference set is left unchanged. 

Nathanson's paper also uses a probabilistic method (inspired by Tao) to 
prove the existence of many MSTD sets in finite abelian groups of the form 
Z/nZ X Z/2Z, and presents a general method for transforming MSTD sets 
in finite abelian groups to MSTD sets in Z. Even more recently, Martin and 
O'Bryant [5] also use probabilistic methods to prove the following impres- 
sive result : there is a positive constant c such that, for all n >> 0, at least 
c • 2'*+^ subsets of {0, 1, 2, n} are MSTD sets. 

In summary, probabilistic methods have shown that the phenomenon of 
MSTD sets is actually quite common. But explicit constructions of such 
sets remain hard to come by, with Nathanson's idea being essentially the 
only one in print. This issue of explicitly constructing MSTD sets will be our 
primary concern here. We were motivated by one of the questions Nathanson 
posed in his talk [4], namely whether there existed any MSTD sets in Z of 
smaller cardinality than Ai above. Our first result is 

Theorem 1 There are no MSTD subsets of Z of size seven. Moreover, 
up to linear transformations, Ai is the unique such set of size 8. 

Clearly, the classification of all MSTD sets of a given size, up to linear 
transformation, is a finite computation. To reduce the complexity of the 
computation to something manageable, even for very small sizes, is quite 
another matter. Our method accomplishes this for sets of size 8, but even 
for size 9, the computation was not feasible. We did manage to find all 
MSTD sets A of size 9 with the following property : for some element x of 
the sumset A + A there are at least four ordered pairs (ai, 02) € A x ^4 with 
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oi + 02 = X. There are exactly nine such sets up to hnear transformation, 
namely 

^4 = {0,1, 2, 4, 5, 9, 12, 13, 14}, 
^2 = {0,1, 2, 4, 7, 8, 12, 14, 15}, 
^5 = {0,2, 3, 4, 7, 9, 13, 14, 16}, 
^6 = {0,2, 3, 4, 7, 11, 12, 14, 16}, 

Ay = {0,2,3,4,7,11,15,16,18}, 

As = {0,2,4,8,9,10,15,17,19}, 

^9 = {0,4, 6, 7, 8, 14, 15, 17, 21}, 
Aio = {0, 5, 6, 9, 10, 13, 16, 17, 22}, 
^11 = {0, 4, 6, 8, 11, 14, 19, 21, 25}. 

Note that A2 is the same set as written earlier with the same name. A4 
is a subset of ^3. The sets ^4,^5,^7 appear in [3]. Of these, A4 and A^ 
are, along with Ai , among the infinite family of MSTD sets constructed in 
Theorem 1 of that paper, whereas A5 is among the family of MSTD sets de- 
scribed in Theorem 2 there. The remaining sets Aq, Aq, Ag, Aiq,Aii appear 
to be new. Working from these examples we will present four constructions 
of infinite families of MSTD sets (Theorems 2/3,4,5,6 below) which gener- 
alise respectively Au/Aq, Ag, Aq and ^14. All constructions share common 
features with, but are nevertheless different from in a non-trivial sense, those 
in [3] and one another. This reflects the main theme of the paper, namely 
that while the most easily describable MSTD sets all seem to have a com- 
mon core of features, within this framework there is substantial room for 
variety. The last part of the paper deals with the question 

'How much larger can the sumset be than the difference set ?' 

On the one hand, we will answer a question in [5] (a weaker version was 
posed in [3]) by showing (Theorem 8) that for every pair j, k of non- negative 
integers there is a positive constant cj^k such that, for all n »j,k 0, at least 
Cj-fc • 2"+i of the subsets A of {0, 1, ...,n} satisfy \A + A\ = (2n + 1) - j, 
\A — A\ = {2n + 1) — 2k. The proof, which also provides explicit examples 
for every j and k, involves two ideas : firstly, extending the probabilistic 
method in [5], and secondly relating MSTD sets in Z to MSTD sets in 
suitably chosen finite cyclic groups. 

On the other hand, the base expansion method suggests that the right 
quantity to look at when studying the above question is not |^+^| — |^ — ^| 
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but rather 



(1) 



Results due to Freiman-Pigarev [1] and Rusza [7,8,9], establish that 

< HA) < 1, 

for any finite set j4 C Z. It is not known if either bound is sharp, and the 
state of knowledge is far worse for the upper bound. Rusza's probabilistic 
method [10] produces a constant c > 1 such that there are a 'multitude' of 
sets A with f{A) > c. He doesn't compute c explicitly, but a quick analysis 
of his method shows that it gives c ~ 1 + 10~^. The set A3 above satis- 
fies /(^s) = = 1.0175... and there appears to be nothing in print which 
beats this. We will give explicit examples of sets which do so, if only slightly. 



The remainder of the paper is organised as follows. In Section 2 we will 
prove Theorem 1. A mathematica code was written to perform the compu- 
tations necessary to complete the proof. In Section 3 we will first indicate 
the computations performed which allowed us to conclude that the list of 
sets 742,...,^!! was complete in the sense mentioned above. Then we will 
prove Theorems 2-6. In Section 4 we will prove Theorem 8 and exhibit the 
sets A with larger values of f{A) than anything previously written down. In 
Section 5, we will give some concluding remarks and suggestions for further 
investigations. 



2. Proof of Theorem 1 



Clearly, it suffices to prove the second statement of the theorem. The loca- 
tion of all MSTD subsets of Z of a certain size can be represented as a finite 
computation as follows : 

Let A be a set of size n > 0, with A = {oj : i = l,...,n} and oi > 
a2 > • • • > a„ = 0. For i = 1, ...,n — 1, represent the difference — Oj+i 
as Cj, the i:th. standard basis vectoi0 in R""^. A computer program, if 
now asked to compute those quantities, will return [^ + ^41 = n{n + l)/2, 
[A — A| = n(n — 1) + 1. So if A is to be an MSTD set, there must be 
a non-trivial coincidence of differences. That is, there must exist i,j,k,l 

^Actually, in our computer program we represent ai — Ui+i initially by ek-i- 
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such that Oj — aj = ak — ai and ^ {k,l). Given such an equation we 
can, by projection onto the orthogonal complement in R"~^ of the subspace 
spanned by (cj — ej) — (e^ — e}), now represent the elements of A by vectors 
in R""2 and recompute \A + A\ and |^ - If stiU |A + ^41 < |A - A\ we 
can pick another non-trivial identification of elements in A — A, and repeat 
the above procedure, with the elements of A now represented by vectors in 
Clearly, the computation will terminate with all MSTD sets of size n 
and smallest element zero located, possibly including infinite parameterised 
families of such sets. 

The above computation seems to be practically rather unfeasible even for 
n = 8 however. We estimate that our machine would have taken several 
weeks at least to finish the calculation. For n = 8 one starts with vectors 
in R^. It turns out however that, with a modest amount of argument, one 
can show that if A is an MSTD set, then there must appear one of 18 pos- 
sible configurations, each of which reduces the problem to R^ or R^. We 
thus allowed our program to instead examine each of these 18 possibilities 
in turn, and the average running time was about 45 minutes. Though the 
argument used to simplify matters could be pushed further, it wasn't ob- 
vious to us how to do so without an effort which would essentially balance 
out the resulting reduction in computing time. Thus our proof of Theorem 
1 will consist of two parts : 

Part One : reduction to 18 possible cases as described above. 

Part Two : a computer program to search through all these cases in turn. 

Part One 

As before, let A C Z be a set of size 8, A = {oj : i = 1,...,8} where 
oi > a2 > • • • > ag = 0. We introduce some further notation. Let 

S:={{ai,aj) : 1 < i < j < 8}, 
S := A + A = {ai + aj : {ai, aj) € S}, 
D:={{ai,aj):l<i<3<8}, 
V := {A - A)>o = {ai - aj : {ai, aj) G D}. 

Let TZi and TZ2 be the equivalence relations on S and D respectively defined 

by 

{{ai, aj), {ak, ai)} e Ui <^ ai + aj = + a/, 
{{tti, aj), {ak, ai)} G 7^2 <^=^ Oi - aj = ak - ai- 
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Let S := S/TZi and A := Obviously one can identify S with <S and 

A with T>, but it will be convenient for us to have a separate notation when 
referring to the equivalence relations. Let |S| := a and |A| := 5. Thus 
1^ + ^1 = a and \A- A\ = 25 + 1. So if A is an MSTD set then 

25 + Ka. (2) 

Definition : An equivalence class in S will be said to be nice if it contains 
at least three elements {ai,aj), {ak,ai), {am^o-n) such that all six indices 
i, J, k, I, m, n are distinct. 

One readily checks that if there is a nice S-class, then A must contain, 
up to symmetry, one of the following 16 configurations : 



ai + as = 


a2 + ay 


= as + ae, 


(3) 


ai + as = 


a2 + ay 


= as + a5. 


(4) 


ai + as = 


a2 + ay 


= as + a4. 


(5) 


ai + as = 


a2 + ay 


= a4 + as. 


(6) 


ai + as = 


a2 + ae 


= as + as. 


(7) 


ai + as = 


a2 + ae 


= as + a4. 


(8) 


ai + as = 


a2 + ae 


= a4 + as. 


(9) 


ai + as = 


02 + 05 


= 03 + a4, 


(10) 


ai + as = 


as + o-e 


= 04 + as, 


(11) 


ai + a7 = 


a2 + ae 


= 03 + 05, 


(12) 


ai + 07 = 


a2 + uq 


= as + a4. 


(13) 


ai + ay = 


a2 + ae 


= a4 + as. 


(14) 


ai + ay = 


a2 + 05 


= as + a4. 


(15) 


ai + ay = 


as + ae 


= a4 + as. 


(16) 


di + ae = 


a2 + 05 


= as + a4. 


(17) 


a2 + ay = 


: as + ae 


= 04 + as. 


(18) 



Let ni < n2 < ■ ■ ■ < ns he the sizes of the A-classes, arranged in some 
increasing order. Hence 

5 

J2ni = \D\ = 28. (19) 

i=l 

Note that ni = 1 since (01,03) is in a class by itself. If {ai,aj) is 7^2- 
equivalent to {ak,ai), where i < k < j < I, then {ai,ai) is T^i-equivalent 
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to {ak,aj). This defines a mapping cj) from 7?.2-equivalent pairs of elements 
of D to 7?.i-equivalent pairs of elements of S. The mapping is obviously at 

most 2-1, hence its range consists of at least ^ Y,i=i ^ ^* ^ pairs. If, in 

addition, there are no nice S-classes, it follows that 

We examine the following two cases : 
Case I : 712 > 1- 

Case II : n2 = 1 and there are no nice S-classes. 

First consider Case I. Then every A-class, other than that consisting of 
the single pair (ai, ag), contains at least two members of D. First of all this 
forces ai — 07 = 02 — ag, hence 

ai + as = a2 + aj. (21) 

Next consider the three differences ai — ag, 02 — a-j and 03 — og. The two 
largest ones must be equal. If ai — ae = 03 — ag then ai +ag = 03 + 05 which, 
together with (21), implies that A contains the nice configuration (3). Oth- 
erwise, up to symmetry, we may assume that oi — og = 02 — 07 > 03 — og. It 
is then easily checked that one of the following three possibilities must hold : 

(a) ai — 05 = 03 — ag. This, together with (21), implies that A contains 
the nice configuration (4). 

(b) ai — 05 = 02 — ag. Thus in A we have that 

ai — 02 = 05 — ag = ag — 07 = 07 — ag. (22) 

(c) a2 — ag = as — ag. Thus in A we have that 

ai - a2 = ag - a7 = a7 - ag = -(a2 - 03). (23) 

This completes the analysis of Case I. We have shown that, under these 
circumstances, either there is a nice S-class or, up to symmetry, A contains 
one of the configurations (22) and (23) . 
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Next consider Case II. We claim that in this case, A cannot be an MSTD 
set. Suppose otherwise. A priori, cr < 36 so (2) forces 6 < 17. Now this plus 

(19) and the assumption that n2 = 1 mean that Y,i=i ^ ^ > H- Then 

(20) implies that a < 30. The idea is now to iterate this kind of argument 

to gradually reduce 5 until we obtain the contradiction that S < 7. 

If (J < 30 then (2) forces 5 < 14. Then (19) and the assumption that 

n2 = 1 force Ei=i ( 2* ) - ^^^^ ^^^^^^ ^ - 

If (7 < 28 then (2) forces S < 13. Then (19) and the assumption that 

n2 = 1 force Ei=i ( 2* ) - ^^^^ ^^^^ ^^^^^^ ^ - 

If (T < 26 then (2) forces S < 12. Then (19) and the assumption that 

n2 = 1 force J2i=i ( 2' ) " ^^^^ ^'^^^ ^^^^^^ ^ - 

If (T < 25 then (2) forces 6 < 11. Then (19) and the assumption that 

n2 = 1 force Ei=i ( 2' ) - ^^^^ ^^^^ ^ - 

If (7 < 23 then (2) forces S < 10. Then (19) and the assumption that 

na = 1 force Ef=i ( 2' ) - ^^^^ ^^^^ ^ - 

If 0" < 21 then (2) forces 5 < 9. Then (19) and the assumption that 

n2 = 1 force Ei=i ( 2* ) - ^^^^ ^^^^ ^ - 

If cr < 18 then (2) forces S < 8. Then (19) and the assumption that 

n2 = 1 force Ei=i ( 2' ) - '^^^^ ^^^^ ^^^^^^ ^ - 

If (7 < 14 then (2) forces S < 6 and we have our desired contradiction. 

This completes the analysis of CASE II. We have thus shown that, if A 
is an MSTD set then, up to symmetry, either it contains one of the 16 nice 
configurations (3)-(18), or one of the configurations (22) and (23). Thus the 
proof of Theorem 1 is reduced to 18 possible cases, as claimed, and we have 
completed Part One of the proof. 

Part Two 

If A contains a nice configuration, then the differences Oj — Oj+i can be 
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represented with vectors in R^. If either (22) or (23) hold, then it suffices 
with R^. We can then write a program to search for normahsed MSTD 
sets as outlined above. The rather unwieldy mathematica code for such a 
program, with each of the 18 possible input configurations, can be obtained 
from the author. The total running time for the program on our network 
was about 15 hours. The only outputted MSTD sets were Ai and 14 — Ai, 
thus proving Theorem 1. 



If A is an MSTD set of size 9, then the same type of reasoning as above 
can be used to show that either there is a nice S-class or A contains one 
of a small number of other possible configurations analogous to (22) and 
(23). However, a nice E-class only allows us now to represent the differences 
Qi — a^+i, a priori, as vectors in R^. Plus, the number of possible configu- 
rations is now much larger. The resulting computation was not practically 
feasible with our code, so we have not obtained with certainty a full classifi- 
cation of all MSTD sets of size 9, up to linear transformations. Instead, we 
classified all such sets in which some S-class contains at least 4 elements. 
One readily verifies that this leaves, up to symmetry, 25 possible configura- 
tions in A analogous to those in eqs.(3)-(18). The computer then located, 
over a period of about two weeks and after multiple crashes, the nine nor- 
malised MSTD sets A^., A4, A^, ... listed in the introduction. We now present 
a sequence of constructions of infinite families of MSTD sets which include 
five of these nine examples : 



Theorem 2 Let n > 3, < k < n - 2. Let d > 1 be a divisor of 2"^'^ - 1. 
Set 



3. Explicit families of MSTD sets 



X := {2" - 2-' : < j < n}, 
m:= (2"+^-^d) + (2"-2^'), 



Y ■.= m-X. 




B := XUYUZ, 
a := 2", 
A :=SU{a}. 



Then A is a normalised MSTD set with \A + A 



A-A\ + l. 
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Proof : One may verify the following facts : 

(i) i? is a symmetric set, B = m — B. 

(ii) A- A = B - B. 

(iii) A + A={B + B)U {2a}. 

The verifications involve calculations similar to those appearing in the proofs 
of Theorems 1-4 in [3] , hence are omitted. 

Remark : The set is the case n = 3, A; = l,(i = 3of the above theorem. 

In the notation of Theorem 2, when A: = there is a simpler, similar con- 
struction of a family of normalised MSTD sets : basically the arithmetic 
progression Z is not needed. 

Theorem 3 Let n > 3 and 1 < I < n - 2. Set 

X := {2" - 2-'' : < i < n}, 
m := (2"+i - 1) + (2^^ - 2'), 
Y:=m-X, 
B := XUY, 
a := 2", 
A := BU{a}. 

Then A is a normalised MSTD set with \ A + A\ = \A - A\ + 1. 

Proof : As for Theorem 2. Observe that ^9 is the case n = 3, Z = 1 
of the theorem. 

Next we turn to a generalisation of As : 

Theorem 4 Let n,d> 1. Set 

X := {jd : < J < n}, 
m := {4n+ l)d-|- 1, 
Y ■.= m-X, 
Z := {2nd, {2n + l)d], 
B := XUYUZ, 
a := 2nd, 
A:=BU{a}. 
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Then A is a normalised MSTD set with \ A + A\ = \A - A\ + 1. 

Proof : One may verify that B is symmetric, that A — A = B — B and 
that A + A={B + B)U {2a}. 

Remark : The set ^8 is the case n = d = 2 oi this theorem. 

Next we turn to Aq. Note that Aq = AiU {16} and that it does not have 
a symmetric subset of size 8. The following generalises Aq in a different 
direction to the generalisation of Ai given in [3] : 

Theorem 5 Letn>2, k>3. Set 

X := {2j : < i < n}, 
m := 2{k + l)n - 2, 
Y :=m-~ X, 
Z := {2jn -1:1< j<k}, 
B --XUYUZ, 
a* := 2n, 
A* --BU {a*}, 
a := m + 2, 
A:=A*U {a}. 

Then A* and A are both MSTD sets and \A* + A*\ - \A* - A*\ = \A + A\ - 
\A-A\ = 1. 

Remark : The sets A in Theorem 5 provide explicit examples of MSTD 
sets which are not obtained from a symmetric subset by adding a single 
element, followed possibly by a base expansion. Indeed S is a maximal 
symmetric subset of A and = 2. Finally note that the set Aq is the 

case n = 2, k = 3 oi the theorem. 

Proof of Theorem 5 : One may verify the following facts, from which 
the theorem follows : 

(i) B is symmetric. 

(ii) A* - A* = B - B. 

(iii) A* + A* = {B + B)U {2a*}. 

(iv) A-A={A*-A*)U {±a, ±[a - (a* - 1)]}. 
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(v) ^ + ^ = {A* + A*) U {2a, 2a - 2, a + {2kn - 1), o + a*}. 



Our fourth construction is a novel generalisation of the set A4, different 
from that covered by Theorem 1 of [3] : 

Theorem 6 Let n,k > 1. Set 

X:= [0,n], 
m := {2k + 3)n, 
Y :=m-X, 
Z:=U^ti[2in + l,(2i + l)n-l], 
B := XUY UZ, 
a := 2n, 
A :=BU{a}. 

Then A is an MSTD set with \A + A\ - \A - A\ = 1. Furthermore, set 

W := [{2k + i)n,{2k + 5)n), 
A:=AUW. 

Then A is also an MSTD set and |.4 + ^| - |^ - ^| = 2. 

Proof : One may verify the following facts, from which the theorem 
follows : 

(i) B is symmetric. 

(ii) A-A = B-B. 

(iii) A + A = {B + B)U {2a}. 

(iv) A-A={A-A)U ±[{2k + 3)n + 1, (2A; + 5)n - 1]. Hence \A-A\ = 
\A-A\ + {4n-2). 

(y) A + A= {A + A)U [{Ak + 6)n + 1, {4k + 10)n - 2]. Hence |^ + ^| = 
l^ + ^l + (4n- 1). 

Remarks : (a) In the case n = k = 2, the set A coincides with and 
the set AisAiU {16, 17} = {0, 1, 2, 4, 5, 9, 12, 13, 14, 16, 17}. Denote this set 
by A12 for future reference. 

(b) The sets A provide an explicit infinite family of examples of MSTD sets 
in which the size of the sumset is at least two more than the size of the 
difference set, and which are not obtained from a fixed MSTD set by either 
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a base expansion or by the method of Lemma 7 below (the latter method is 
implicit in [3]). 

(c) The set A2 has as a superset A13 = A2U {18, 19, 20} which also satisfies 
|Ai3 + A13I — |yli3 — ^isl = 2. This set is not, however, covered by Theorem 
6, and we have not spotted a way to generalise either ^13 or A2 itself. 

(d) Given an MSTD set M, let s{M) denote the maximal size of a symmet- 
ric subset of M. In Theorem 5, we gave examples in which \M\ — s(M) = 2. 
All our examples prior to that satisfy |M| — s{M) = 1. 

Now consider the sets A in Theorem 6. Each A has the symmetric subset 
B and \ A\B\ = n + 1. Let C = {n, {2k + 3)n} U {Z\[2n + 1, 3n - 1]). Then 
A\C is also symmetric and |^\C| = (A; — l)(n — 1) + 2. It is easily checked 
that A has no symmetric subset which is larger than both B and A\C. Thus 
we get examples of MSTD sets M in which the difference |M| — s{M) can 
be made arbitrarily large. The sets constructed in the proof of Theorem 
8 below provide further examples of this phenomenon. Again, the point is 
that in both cases, these examples are not base expansions of a fixed set. 

4. How much larger can the sumset be ? 

Starting from any fixed MSTD set, the base expansion method allows one 
to construct MSTD sets A for which the quotients \A + A\/\A — A\ become 
arbitrarily large. Here we are interested in both the quotient |y4 + ^|/|A — ^| 
and the difference |^ + ^| — |^ — ^|. Our first step is to describe a way to 
identify an MSTD subset ^ of Z with MSTD subsets of Z/nZ for suitable 
n depending on A. This idea is also implicitly contained in [3]. 

Let A be a normalised MSTD subset of Z, m := max{a G A}. Then 
A can be considered as an MSTD subset of Z/nZ for any n > 2m. More 
generally, we make the following definition : 

Definition : Let ^ be a normalised subset of Z (not necessarily an MSTD 
set) with largest element m, and n > an integer. Set A' := A n [0,n) 
and identify A' with a subset of Z/nZ in the natural way. We call A' 
the reduction of A modulo n. Then A is said to be reducible modulo n if 
A = {x e [0,m] : X (mod n) G A'}. If, in addition, A' is an MSTD set in 
Z/raZ, we say that A has good MSTD-reduction modulo n. 

The important observation, which is basically a reformulation and sharp- 
ening of Theorem 8 of [3] in the case of cyclic groups, is the following : 

Lemma 7 Let A be a normalised subset of Z and n > an integer. Suppose 
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A is reducible modulo n with reduction A' . Let 

fin, A) := \A' + A'\ - \A' - A'\. 

Put B := A(J {A + n). Then B is a normalised subset of Z and 

\B + B\-\B-B\ = {\A + A\-\A-A\) + 2- f{n, A). (24) 

Proof : Let X be any congruence class of integers modulo n. The re- 
ducibility of A modulo n implies the following : If xi,X2 E X H A + A (resp. 

XnA-A), and X3 G {xi,X2)nX, then X3 e X riA + A (resp. XCiA-A). 
Furthermore, X D A + A (resp. X Ci A — A) is non-empty if and only if 
XnB + B (resp. X n B - B) is. 

Suppose X D A + A is non-empty with largest element x. It follows 
that XnB + B = {X n A + A) U {x + n, x + 2n}. Similarly, suppose 
X n A — A is non-empty with largest clement xq and smallest element xi. 
Then X n B - B = {X f] A - A) U {xq + n, xi - n}. 

Eq. (24) follows immediately from these observations. 

Each of the sets listed on page 4 has good MSTD reduction modulo n, 
for some n considerably less than 2 • max{a G A}. For example, we can take 



^^ = A4\{12,13,14}, 


n 


= 12, 


/(12,A4) = 


1, 


A = 


n 


= 18, 


/(18,A2) = 


1, 


4 = A5\{16}, 


n 


= 16, 


/(16,^5) = 


1, 


K = ^6, 


n 


= 21, 


/(21,A) = 


2, 


= A7\{16, 18}, 


n 


= 16, 


/(16,^7) = 


1, 


^^ = A8\{17,19}, 


n 


= 17, 


7(17,^8) = 


1, 


A'9 = ^9\{17,21}, 


n 


= 17, 


7(17,^9) = 


1, 


Aio = Aio\{17,22}, 


n - 


= 17, 


/(17,Aio) = 


1, 


^;i=^ii\{21,25}, 


n ■ 


= 21, 


/(21,^ii) = 


1. 



More generally, in the notation of the theorems above : 

Theorem 2 : For given n, /c, d the set A has good MSTD reduction mod- 
ulo r := (2"+i + d) + (2"-i - 2^) and /(r, A) = 1. 

Theorem 3 : For given n, Z the set A has good MSTD reduction modulo 
r ■= (2'*+i + (2"-i - 2O and /(r, A) = 1. 

Theorem 4 : For given the set A has good MSTD reduction modulo 
r := And -\- 1 and /(r, ^4) = 1. 



15 



Theorem 5 : For given n,k the set A has good MSTD reduction modulo 

r := 2{k + 2)n + 1. If A; = 3 then /(r, A) =2. If A;> 3 then /(r, A) = 1. 
Theorem 6 : For given n, k the set A has good MSTD reduction modulo 
r := {2k + 2)n and /(r, A) = 1. Note that, in fact, A + A (mod r) = Z/rZ 
and ^ - ^ (mod r) = Z/rZ\{r/2}. 

The rather tedious proofs of these statements are omitted. Instead we note 
that of particular interest is the fact that /(21, ^g) > 1- This means that in 
repeatedly applying Lemma 7, the difference in size between the sum- and 
difference sets will grow more quickly. Let 

Au ■■= Aq U {Aq + 21) U (^6 + 42) 

= {0, 2, 4, 5, 9, 12, 13, 14, 16, 21, 23, 25, 26, 30, 33, 34, 35, 37, 42, 44, 46, 47, 51, 54, 55, 56, 58}. 

Then \Ai4 + Ai4^\ = 114 and [^14 — Ai4\ = 105, so in the notation of (1), we 
have /(A14) = ElM = 1.01767.... Thus /(A14) > /{As). By the way note 
that ^3 = A12 U {A12 + 12). 

The following example does even better : Let 

X := {0, 1, 2, 4, 5, 9, 12, 13, 17, 20, 21, 22, 24, 25}. 

(this set appears in [5], where it is denoted S4). Then — |X — X| =4, 

X is reducible modulo 20 and /(20, X) = 2. Take 

Ai5 ■.= X\J{X + 20) 

= {0, 1, 2, 4, 5, 9, 12, 13, 17, 20, 21, 22, 24, 25, 29, 32, 33, 37, 40, 41, 42, 44, 45}. 

Then 1^15 + Aisl = 91 and |^i5 - ^isl = 83, so /(^is) = e§ = 1-02082.... 

Our final result resolves in the affirmative Conjecture 20 of [5] : 

Theorem 8 Let j, k be any two non-negative integers. Then there exists 
a constant Cjj^ G (0, 1) such that, for all n sufficiently large, depending 
on j and k, at least Cj^k ■ 2"^^ of the subsets A of {0, 1, 2, n} satisfy 
\A + A\ = (2n + 1) - j and \A - A\ = (2n + 1) - 2k. 

Proof : The proof is an extension of the method of [5] and consists of 
two separate constructions depending on the sign of j — fc. 
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Case I : j <k. 

We make use of the set Au = {0,1,2,4,5,9,12,13,14,16,17} above. It 
satisfies A12 + ^12 = [0, 34] and A12 - A12 = [-17, 17]\{±6}. Furthermore, 
this set has good MSTD reduction modulo 12. 
Let j, k be given. Set 

k-l 

Xk := U ^12 + 12t. 

t=o 

Let ruh = maxXfc = 12A; + 5. By the proof of Lemma 7, we have that 
Xk + Xk = [0, 2mfe] and \Xk — Xk\ = {2mk + 1) — 2/c, since the difference set 
misses all numbers congruent to 6 (mod 12) in the interval [— mfe,mfe]. Now 
set 

Xj,k ■= Xk\{l2{k -t) + l:0<t<j}. 

One readily checks that Xj^^ — Xj^k = Xk — Xk but that Xj_k + Xj^k = 
[0, 2m]\{24(A; — t)+3:0<t< j}. Thus the sets Xj^k already provide 
explicit examples of sets satisfying the requirements of Theorem 8. To prove 
the existence of positive constants Cj^k we use the method of [5]. Set Lj^k '■= 
{mk-Xj^k)^{mk, 2mfe] and Uj^k ■= n-{XkU{mk, 2mk]), for any n > 4mfe+l. 
Finally put 

^j,k '■= Lj^k U -R U Uj^k, 

where R is the random subset of (2mfc, n — 2mk) obtained by choosing each 
number in the interval independently with probability 1/2. Our choices of 
Lj^k and Uj^k imply that 

#{([0, 4mfe] U [2n - 4mk, 2n])\(A,- ^ + A,- fe)} = j, 
#{[n - 4mk, n]\{Aj^k - = ^■ 

It then suffices to apply the same type of argument as in [5] to show that, 
with high probability, both (4mfe,2n — 4mfc) C Aj^k+Aj,k and [0,n — 4mfc) C 
Aj^k — Aj,kj and thus deduce the existence of a constant Cj^k > 0- 

Case II : j > k. 

We start by describing, for each j > 0, an integer mj > — 1 and a sub- 
set Lj C [0, rrij] with the following properties : 
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(i) Lj + Lj = [0, 2mj]\{2.Ti, 2xj} where < xi < • • • < Xj < mj/2. 

(ii) None of the numbers 2xu — x^, foi 1 < u,v < j, is in Lj. 

Set mo := —1, Lq := (j). This is consistent with (i) and (ii) above. We 
define the numbers mi < m2 < ■ ■ ■ and the sets Li C L2 C ■ ■ ■ inductively 
as follows : for each i > 0, 

mj+i := 3mi + 16, 

Lj+i := Li U {{mi + 1) + {0, 1, 2, 5, + 10, + 11, 2mi + 15}} . 

Each mj is an odd number and one readily checks that 

mj = 7- 3^ - 8, (25) 

\Lj\ = l{S'-l)+2j, (26) 

Lj + Lj = [0, 2mj]\{2xt = 2{mt + 5) -.0 <t < j}. (27) 

It is then a simple exercise to show that conditions (i) and (ii) above on Lj 
are satisfied. 

Now let j, k be given with j > k and first suppose A; > 0. Set 

Lj^k := Lj U {2xt+i - xt : 1 <t < j - k}, 

and observe that the numbers xt grow sufficiently fast so that for any k, 
Lj^k + Lj^k = Lj + Lj . For any n > 2m j + 1 set 

Uj^k = Uj := n - (Lj U {xi, x^}). 

Then, as usual, take finally Aj^k ■= Lj^k U i? U Uj^ki where R is the random 
subset of {mj,n — mj) obtained by choosing each number independently 
with probability 1/2. One easily checks that 

([0, 2m,] U [2n - 2m,-, 2n])\(A,- ^ + Aj^k) = {2xt : t = 1, j}, 
and [n — 2mj,n]\{Aj^k — ^j,k) = {n — 1 — 2xt : t = 1 or j — k + 2 < t < j}. 

18 



By the method of [5] both {2mj, 2n — 2mj) C Aj^^ + ^j,fe and [0, n — 2mj) C 
Aj^k — Aj^k occur with high probabihty, from which we deduce the existence 
of a positive constant c^.^. 

When k = we just have to be a little careful. One may check that the 
following choices work : 

Ljfi := Lj+2 U {xi} U {2xt+i - Xt : I < t < j + 1}, Ujfi := Uj+2, 

and thus the proof of Theorem 8 is complete. 

Remarks : (a) In each of the above constructions we get explicit exam- 
ples of sets satisfying the requirements of Theorem 8 by taking the set R to 
consist of the entire interval over which it is defined. These examples are 
not covered by any of the constructions in Section 3, and in particular that 
in Case II is somewhat different from any of the earlier ones, 
(b) Prom the above proof we obtain estimates of the form Cj^k = i^(2~®('"')), 
where / = max{j, k}. The numbers irii grow linearly when j < k hnt expo- 
nentially when k > j. This lack of symmetry is unsatisfying, though in any 
case the estimates are likely to be way smaller than the truth for all values 
of j and k. 

5. Concluding remarks 

We have provided various explicit constructions of non-trivial infinite fam- 
ilies of MSTD sets. All our constructions, including those in the proof of 
Theorem 8, and in common with those already in the literature, are roughly 
based on some type of symmetric set which is 'perturbed' slightly by adding 
on a small number of elements. Often, though not always, the symmetric set 
is itself constructed out of some (generalised) arithmetic progression. The- 
orems 2,3 provide an example where this is not so : there is an arithmetic 
progression Z in the former construction, but both are based on a geometric 
progression X. The sets constructed in the proof of Theorem 8 are also 
of a somewhat different character. Note that in Theorem 6, the set Z is 
a GAP of dimension 2. We have not investigated whether the arithmetic 
progressions appearing in our various constructions can be replaced, as in 
[3], by GAP:s of higher dimension. More interesting, though, would be to 
have explicit examples of MSTD sets which are, in some meaningful sense, 
'radically' different from the blueprint of a perturbed symmetric set. The 
ubiquity of MSTD sets, as exhibited by probabilistic techniques to which 
Theorem 8 is our contribution, mean that such examples should/must exist. 



19 



Note, by the way, that we have been unable to provide non-trivial families 
of MSTD sets which generalise the sets A2 and Aiq. 

Otherwise, the outstanding open problem seems to us to be to obtain 
a reasonably tight estimate for L := limsup^^^^^ f{A), where / is the 
function of eq.(l). The set A15 leaves the yawning gap ^ L < ^. 
Finding good estimates for the constants Cj^^ in Theorem 8 is also a very 
appealing problem and several questions in a similar vein are suggested by 
the material in [5]. 

Finally, one would obviously like to be able to complete the classification, 
up to linear transformation, of MSTD sets of size 9, and if possible extend 
the range of computation to larger sizes. An interesting question is to find 
the smallest no > such that there are infinitely many different normalised 
MSTD sets of size no- All our constructions, plus those of [3], provide 
only finitely many sets of any given size. An upper bound is no = 16, for 
which we can argue as follows : If A,B C Z and C = A X B C Z^, then 
\C ±C\ = \A±A\-\B± B\. In particular, if A is an MSTD set and B 
has at least as many sums as differences, then C is an MSTD set in Z^. 
For suitable A, /i € N the map (a, 6) 1— > Aa + fib takes C to a set Cx^fj_ C Z 
for which |Ca,^ ± Ca,^| = \C ± C\ - this is just the base expansion method. 
If min{|A|, \B\} > 1 then there is a linear transformation taking Cx^^^^^ to 

Ai Ml 



Cx2,fi2 if and only if 



A2 1^2 



0. In particular, we get infinitely many 



different sets up to linear transformation, each of size Since there 

is an MSTD set of size 8, namely Ai, and we may take B = {0, 1} so that 
= \B — B\ = 3, then we may conclude that there are infinitely many 
normalised MSTD sets in Z of size 16. 
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